Automatic identification and skew estimation of text lines in real scene images

نویسندگان

  • Stefano Messelodi
  • Carla Maria Modena
چکیده

A method for the automatic localization of text embedded in complex images is proposed. It permits to detect the spatial position and the skew of the text lines which are present in the scene and to return a binary representation of each text line. Strenghts of the algorithm are independece of text skew and of presence of connected text. After a preprocessing step the input image is segmented in order to obtain a set of connected components which represent the basic objects of the algorithm. Several heuristics are proposed to characterize text objects which depend both on the geometrical features of single components and on the geometrical and spatial relations among components. According to these heuristics several components are discarded and the retained ones are grouped into text lines candidates by means of a divisive hierachical clustering procedure. In the experimental session we describe the application of the algorithm to the extraction of text lines from the images of 100 book covers. Results about skew estimation are also reported.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Character Skew Rectification in Scene Text Images

We present an efficient method for character skew rectification in scene text images. The method is based on a novel skew estimators, which exploit intuitive glyph properties and which can be efficiently computed in a linear time. The estimators are evaluated on a synthetically generated data (including Latin, Cyrillic, Greek, Runic scripts) and real scene text images, where the skew rectificat...

متن کامل

Part-Based Skew Estimation for Mathematical Expressions

We propose a novel method for the skew estimation on text images containing mathematical expressions which can be applied to various characters layouts. Current OCR systems are not capable of recognizing skewed characters in images correctly, and hence skew correction in such images is essential for character recognition. Conventionally methods such as projection profile methods, Hough transfor...

متن کامل

Skew Detection from Natrual Scene Images: A Review

Natural scene images are generally captured with portable devices such as mobile phone cameras. Scene images contains text information as part of captured scene. Scene image text poses difficultly in processing as compared to document text due to complexity of scene and open environment conditions. Scene images usually suffer from skew deformation due to inherent nature of portable capturing de...

متن کامل

An Integrated System for Handwritten Document Image Processing

In this paper we attempt to face common problems of handwritten documents such as nonparallel text lines in a page, hill and dale writing, slanted and connected characters. Towards this end an integrated system for document image preprocessing is presented. This system consists of the following modules: skew angle estimation and correction, line and word segmentation, slope and slant correction...

متن کامل

Automatic detection and recognition of Malayalam text from natural scene images

In this paper we describe a very simple and efficient method for the détection and recognition of the Malayalam text from colour natural scene images taken by a mobile phone camera. Malayalam text detection, skew correction of the detected text ,text segmentation and character recognition are the important steps in text understanding from natural scene images. Text understanding in natural scen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pattern Recognition

دوره 32  شماره 

صفحات  -

تاریخ انتشار 1999